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We present the results of a search for the flavor-changing neutral current decay B® — > fi + fx~ 
using a data set with integrated luminosity of 240 pb _1 of pp collisions at \fs = 1.96 TeV collected 
with the D0 detector in Run II of the Fermilab Tevatron collider. We find the upper limit on the 
branching fraction to be B(B { ^ —> fi + < 5.0 x 10 -7 at the 95% C.L. assuming no contributions 
from the decay B^ — > fi + fj,~ in the signal region. This limit is the most stringent upper bound on 
the branching fraction B° — ► fi + [J>~ to date. 

PACS numbers: 13.20.He, 12.15.Mm, 14.40.Nd 



The purely leptonic decays B® s — > i± + [i~ [l| are flavor- 
changing neutral current (FCNC) processes. In the stan- 
dard model (SM), these decays are forbidden at the tree 
level and proceed at a very low rate through higher- 
order diagrams. The SM leptonic branching fractions (B) 



were calculated including QCD corrections in Ref. |2|. 
The latest SM prediction '% is B{B° S -> il + n~) = 
(3.42 ± 0.54) x 10~ 9 , where the error is dominated by 
non-perturbative uncertainties. The leptonic branching 
fraction of the B® decay is suppressed by CKM matrix 
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elements \Vtd/Vt s \ 2 leading to a predicted SM branching 



fraction of (1.00±0.14) x lCT 10 . The best existing exper- 
imental bound for the branching fraction of B° s (£>°) is 
presently B{B° S (S?) < 7.5 x 1CT 7 (1.9 x 10" 7 ) 

at the 95% C.L. Q. 

The decay amplitude of B^ s — > ^ + /i~ can be signif- 
icantly enhanced in some extensions of the SM. For in- 
stance, in the type-II two-Higgs-doublet model (2HDM) 
the branching fraction depends only on the charged Higgs 
mass Mh+ and tan /3, the ratio of the two neutral Higgs 
field vacuum expectation values, with the branching frac- 
tion growing like (tan/3) 4 0. In the minimal super- 
symmetric standard model (MSSM), however, B(B° — > 
oc (tan/3) 6 , leading to an enhancement of up to 
three orders of magnitude Q compared to the SM, even 
if the MSSM with minimal flavor violation (MFV) is con- 
sidered, i.e., the CKM matrix is the only source of flavor 
violation. An observation of B® — > ji + [iT would then im- 
mediately lead to an upper bound on the heaviest mass 
in the MSSM Higgs sector if MFV applies. In minimal 
supergravity models, an enhancement of B(B® — ► /i + /i~) 
is correlated Q with a sizeable positive shift in (g — 2) p 
that also requires large tan/3. A large value of tan/3 
is theoretically well-motivated by grand unified theories 
(GUT) based on minimal SO(10). These models pre- 
dict large enhancements of B(B® — > (J. + fi~) as well [s|9J. 
Finally, FCNC decays of B® s are also sensitive to su- 
persymmetric models with non-minimal flavor violation 
structures such as the generic MSSM [l0| and i?-parity 
violating supersymmetry [ill ]. 

In this Letter we report on a search for the decay 
Bg —>■ using a data set of integrated luminosity 

of 240 pb~ x recorded with the D0 detector in the years 
2002-2004. Our mass resolution is not sufficient to read- 
ily separate B® from B® leptonic decays. For the final 
calculation of the upper limit on B(B® — > /i + ^i~) we as- 



sumed that there is no contribution from B^ 



decays in our search region due to its suppression by 
\Vtd/V t s\ 2 , which holds in all models with MFV. 

The D0 detector is described in detail elsewhere [l3). 
The main elements, relevant for this analysis, are the 
central tracking and muon detector systems. The central 
tracking system consists of a silicon microstrip tracker 
(SMT) and a central fiber tracker (CFT), both located 
within a 2 T superconducting solenoidal magnet. Lo- 
cated outside the calorimeter, the muon detector consists 
of a layer of tracking detectors and scintillation trigger 
counters in front of toroidal magnets (1.8 T), followed 
by two more similar layers behind the toroids, allowing 
for efficient muon detection out to rj of about ±2, where 
i] = — ln[tan(#/2)] is the pseudorapidity and 9 is the po- 
lar angle measured relative to the proton beam direction. 

Four versions of dimuon triggers were used in the data 
selection of this analysis. A trigger simulation was used 
to estimate the trigger efficiency for the signal and nor- 
malization samples. These efficiencies were also checked 



with data samples collected with single muon triggers. 
Event preselection started by requiring two muons of op- 
posite charge to be identified by extrapolating charged 
tracks reconstructed in the central tracking detectors 
to the muon detectors, and matching them with hits 
in the latter. The muons had to form a common sec- 
ondary 3D-vertex with an invariant mass to(/! + /i~) be- 
tween 4.5 and 7.0 GeV/c 2 and a x 2 per degree of free- 
dom of x 2 /d.o.f < 10. Each muon was required to have 
Pt > 2.5 GeV/c and < 2.0. Tracks that were matched 
to each muon were required to have at least three hits 
in the SMT and at least four hits in the CFT. To se- 
lect well-measured secondary vertices, we determined the 
two-dimensional decay length L xy in the plane transverse 
to the beamline, and required the uncertainty SL xy to be 

less than 0.15 mm. L xy was calculated as L xy — lvtx ^r ; 

where p^ is the transverse momentum of the candidate 
B® and l vtx represents the vector pointing from the pri- 
mary vertex to the secondary vertex. The error on the 
transverse decay length, SL xy , was calculated by taking 
into account the uncertainties in both the primary and 
secondary vertex positions. The primary vertex itself was 
found for each event using a beam-spot constrained fit as 
described in Ref. [Til ] . To ensure a similar pt dependence 
of the system in the signal and in the normaliza- 

tion channel, p^ had to be greater than 5 GeV/c. A total 
of 38,167 events survive these preselection requirements. 
The effects of these criteria on the number of events are 
shown in Table [I] 



TABLE I: Number of candidate events in data satisfying suc- 
cessive preselection requirements. 



Variable 



Mass (GeV/c z ) 
Muon quality 
X 2 /d.o.f of vertex 
Muon p T (GeV/c) 
Muon | ?; 
Tracking hits 
SL xy (mm) 

B° candidate Pt (GeV/c) 



Requirement # Candidates 



4.5 < m M+M - 



<7.0 405,307 
234,792 

< 10 146,982 

> 2.5 129,558 

< 2.0 125,679 
CFT > 4, SMT > 3 92,678 

< 0.15 90,935 

> 5.0 38,167 



For the final event selection, we required the candidate 
events to pass additional criteria. The long lifetime of the 
B® mesons allows us to reject random combinatoric back- 
ground. We therefore used the decay length significance 
L xy /SL xy as one of the discriminating variables, since it 
gives better discriminating power than the transverse de- 
cay length alone, as large values of L xy may originate due 
to large uncertainties. 

The fragmentation characteristics of the b quark are 
such that most of its momentum is carried by the b 
hadron. Thus the number of extra tracks near the B® 
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candidate tends to be small. The second discriminant 
was therefore an isolation variable, X, of the muon pair, 
defined as: 



track z^i? 



(i) 



Here, £ Pi i s the scalar sum over all tracks exclud- 

track i^B 

ing the muon pair within a cone of AT?. < 1 around 
the momentum vector p(fi + p,~) of the muon pair where 
ATI = V(A0) 2 + (A??) 2 . 

The final discriminating variable was the pointing an- 
gle a, defined as the angle between the momentum vector 
p(p + pr) of the muon pair and the vector l vtx between 
the primary and secondary vertices. This requirement 
ensured consistency between the direction of the decay 
vertex and the momentum vector of the B® candidate. 

An optimization based on these discriminating vari- 
ables was done on signal Monte Carlo (MC) events in 
the B° mass region 4.53 < M f , + fl - < 6.15 GeV/c 2 with 
m B o = 5369.6 ±2.4 MeV/c 2 and on data events 
in regions outside the signal window, i.e., in the side- 
bands. The mass scale throughout this analysis is shifted 
downward with respect to the world average B® mass by 
30 MeV/c 2 to compensate for the shift in the momen- 
tum scale of the D0 tracking system. The mass shift 
was found by linear interpolation to the B® mass of the 
measured mass shifts between the J/tp and the T reso- 
nances relative to their world average values The 
mass shift is smaller than the MC predicted mass reso- 
lution for two-body decays of a = 90 MeV/c 2 at the B® 
mass. 

In order to avoid biasing the optimization procedure, 
data candidates in the signal mass region were not exam- 
ined until completion of the analysis, and events in the 
sideband regions around the B® mass were used instead. 
The start (end) of the upper (lower) sideband was cho- 
sen such that they were at least 3a (270 MeV/c 2 ) away 
from the B® mass. The widths of the sidebands used for 
background estimation were chosen to be 6cr each. The 
size of the blind signal region was ±3er around the J5° 
mass. To determine the limit on the branching fraction, 
we used a smaller mass region of ±2cr. 

A random-grid search |l5| and an optimization pro- 
cedure 0] were used to find the optimal values of 
the discriminating variables, by maximizing the variable 

P = enil /(a/2 + y/Nblck). Here, e M/ l is the reconstruc- 
tion efficiency of the signal events relative to the prese- 
lection (estimated using MC), and A^ack is the expected 
number of background events interpolated from the side- 
bands. The constant a is the number of standard de- 
viations corresponding to the confidence level at which 
the signal hypothesis is tested. This constant a was set 
to 2.0, corresponding to about the 95% C.L. Figure 
shows the distribution of the three discriminating vari- 



ables after the preselection for signal MC events and 
data in the sideband regions. After optimization, we 
found the following values for the discriminating vari- 
ables and MC signal efficiencies relative to the preselected 
sample: L xy /SL xy > 18.5 (47.5%), T > 0.56 (97.4%), 
and a < 0.2 rad (83.4%). The combined efficiency for 
signal events to survive these three additional selection 
criteria, as measured relative to preselection criteria, is 
(38.6±0.7)%, where the error is due to limited MC statis- 
tics. A linear extrapolation of the sideband population 
for the whole data sample into the (±180 MeV/c 2 ) signal 
region yields an expected number of 3.7±1.1 background 
events. 

Upon examining the data in all mass regions, four 
events are observed in the signal region, entirely consis- 
tent with the background events as estimated from side- 
bands. We examined the four observed events in detail 
by studying various kinematic variables, e.g., px of the 
muons, isolation, etc., and found them to be compatible 
with background events. Figure |21 shows the remaining 
events populating the lower and upper sidebands as well 
as the signal region almost equally. 

In the absence of an apparent signal, a limit on the 
branching fraction B(B® — > /i + /i~) can be computed 
by normalizing the upper limit on the number of events 
in the B® signal region to the number of reconstructed 
B ± J/ip K ± events: 



N, 



B± 



B° 



B{B ± -► J/iI)( ! j,+ijl-)K ± ) 



R- 



(2) 



where 



• iV u i is the upper limit on the number of signal de- 
cays estimated from the number of observed events 
and expected background events. 



• N B ± is the number of observed B^ 
events. 



J/ipK d 



B i 

• e M /l and e^„ K are the efficiencies of the signal and 
normalization channels, obtained from MC simula- 
tions. 

• B(B ± -> J/ipin+fi-)^) is the product of the 
branching fractions B(B ± -> J/ipK ± ) = (1.00 ± 
0.04) x 10~ 3 and B(J/ip -► ^ + aO = (5.88±0.10) x 

lO" 2 E3. 

• fb^B a J fb->B u>d = 0.270 ± 0.034 is the fragmenta- 
tion ratio of a b quark producing a B® and a B Ui d 
meson. This ratio has been calculated using the 
latest world average fragmentation values |l2j for 
B® and B UJ i mesons, where the uncertainty on the 
ratio is conservatively calculated assuming a full 
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FIG. 1: Discriminating variables after the preselection for signal MC (solid line) and data events (dashed line) from the 
sidebands. The arrows indicate the discriminating values that were obtained after optimization. The normalization is done on 
the number of signal MC and sideband data events after preselection. 
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FIG. 2: Invariant mass of the remaining events of the full data 
sample after optimized requirements on the discriminating 
variables. 



FIG. 3: Invariant mass distribution for candidates in the nor- 
malization channel B — > J/4>K 



± 



anti-correlation among the individual B Ui d and B® 
fragmentation uncertainties. 

• R ■ e/iju/e^ i s tne branching fraction ratio R = 
B(Bj)/B(Bg) of B° s mesons decaying into two 
muons multiplied by the total detection efficiency 
ratio 17]. Any non-negligible contribution due to 
Bj decays (R > 0) would make the limit on the 
branching fraction B(B® — > as given in 

Eq. J2J) smaller. Our limit presented for B(B® — > 
fi + fi~) is therefore conservative. 

Using the B ± -> J/^JC* mode [3 

has the ad- 
vantage that the efficiencies to detect the p + ^i _ sys- 
tem in signal and normalization events are similar, and 
systematic effects tend to cancel. A pure sample of 
B^ — > J /tp K ± events was obtained by applying the fol- 
lowing selection criteria. The mass-constrained vertex fit 
of the two muons to form a J/if was required to have 
a x 2 /d.o.f. < 10, similar to the [i + [i~ vertex criterion 
in the B® — * search. The combined vertex fit of 

the J/ip and the additional #± (p T (-^ ± ) > 0.9 GeV/c) 
had to have \ 2 < 20 for three d.o.f.. The requirements 
on the three discriminating variables were also applied. 
The mass spectrum of the reconstructed B ± — > J/tp K ± 
for the full data sample after all analysis requirements 



is shown in Fig. |21 A fit using a Gaussian for the sig- 
nal and a second order polynomial for the background 
yields 741 ± 31 (stat) ± 22 (sys) B ± candidates, where 
the systematic uncertainty was estimated by varying the 
fit range, background and signal shape hypotheses. 

The pt distribution of the B ± in data has a slightly 
harder spectrum than that from MC. Therefore, MC 
events of the signal and normalization channels have 
been reweighted accordingly. In addition, the observed 
widths of known resonances (J/ip and T(IS')) are 

(27±4)% larger than predicted by MC. The ±2er signal 
mass region using the MC mass resolution therefore cor- 
responds to ±1.58cr when the data mass resolution is con- 
sidered, and the efficiency is corrected accordingly. To 
within errors, the MC correctly reproduces the efficiency 
of the cuts on the discriminating variables when applied 
to the normalization channel. 

The final corrected value for the efficiency ratio is then 

given by e^ K /efl = 0.247 ± 0.009 (stat) ± 0.017 (sys), 
where the first uncertainty is due to limited MC statistics 
and the second accounts for the B^ /B® lifetime ratio un- 
certainties and for uncertainties in data/MC differences. 
These differences include the pT-dcpcndcnt reweighting 
of MC events, signal mass width, the kaon track recon- 
struction efficiency and the effects of different trigger and 
muon identification efficiencies. All systematic uncer- 
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TABLE II: Relative uncertainties used in the calculation of 
an upper limit of B{B® — > ji + 



Source Relative Uncertainty [%] 

Number of B ± -* J/ipK ± events 5.1 
B(B ± -> J/ipK*) 4.0 
B(J/i> ->■ n + n~) 1.7 

fb-,B«l 'fb->B° d 12 ' 7 

Background uncertainty 29.7 



tainties entering the calculation of the branching fraction 
limit are listed in Table ITT1 

The statistical uncertainties on the background expec- 
tation, as well as the uncertainties on the efficiencies can 
be included into the limit calculation by integrating over 
probability functions that parameterize the uncertain- 
ties. We have used a prescription to construct a 
confidence interval with the Feldman and Cousins order- 
ing scheme. The expected background was modeled as a 
Gaussian distribution with its mean value equal to the 
expected number of background events and its standard 
deviation equal to the background uncertainty. The un- 
certainty on the number of B^ events as well as the un- 
certainties on the fragmentation ratio and branching frac- 
tions for — > J/ip(/j, + were added in quadra- 
ture to the efficiency uncertainties and parameterized as 
a Gaussian distribution. The resulting branching frac- 
tion limit |2jj including all the statistical and systematic 
uncertainties at a 95% (90%) C.L. is given by 

B(B° -> fi+fi-) < 5.0 x 10~ 7 (4.1 x 10~ 7 ). 

We also used a Bayesian approach with flat prior and 
Gaussian (smeared) uncertainties [13 and obtained the 
limit of B(B a s -> fi+ fx~) < 5.1 x 10" 7 (4.1 x 10~ 7 ) at 
the 95% (90%) C.L. This new result is presently the most 
stringent bound on B(B®-^ /i + /j~), improving the pre- 
viously published value [J] and can be used to constrain 
models of new physics beyond the SM. 
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